Connecting genes to metabolites by a systems biology approach.
نویسندگان
چکیده
P lants are of pivotal importance to sustain life on Earth because they supply oxygen, food, energy, and many valuable metabolites. All plant constituents, including secondary metabolites, some of which are used as flavors, fragrances, colorants, or pharmaceuticals, are ultimately derived from primary products of photosynthesis through multiple enzymatic steps encoded by the genome of each plant. However, our knowledge of how both primary and secondary metabolites are synthesized and which genes are involved is far from complete. A better understanding of metabolite synthesis and the regulation thereof will be increasingly important for improving the sustainability and efficiency of useful plant production. Recently, the availability of entire genome sequences of Arabidopsis thaliana and rice and the development of functional genomics tools have allowed the elucidation of metabolite syntheses by a systems biology approach (1–3). The mining and exploitation of the data obtained from genomics and the related research areas of genomewide transcriptomics, proteomics, and metabolomics will bring us into a new era of understanding of biological systems (Fig. 1). Sulfur is one of the essential nutrients for all plants, required to synthesize the key amino acids cysteine and methionine, which in turn are needed to produce functional proteins and many secondary metabolites (4). Approximately 90% of the reduced S is bound through these amino acids. Sulfur is also needed in the functional groups of coenzymes, such as biotin and CoA. Many parts of the world have low contents of sulfur in the soil. Although plants have adapted to live under sulfur-deficient conditions, the knowledge of how this adaptation is accomplished has been rather limited (5). Obviously, a better understanding of the mechanisms underlying this adaptation will allow us to improve crop yields in poor soils. In this issue of PNAS, Hirai et al. (6) show significant progress exploring cellular processes by combining genomewide transcriptomics and metabolomics under deficiency of sulfur and nitrogen in the model plant A. thaliana. This contribution is one of the very first articles successfully bringing our current knowledge closer to understanding the important link between genomic data and the function of metabolites in plants. DNA array transcriptome analysis was combined with metabolite profiling and more specific targeted quantitative analysis. Both ‘‘-omics’’ approaches generated a huge amount of data. The authors had to develop novel bioinformatics tools to integrate the data sets and to generate gene-to-metabolite networks. Although the article mainly deals with primary metabolism, interesting conclusions were also obtained for secondary S-containing health beneficial metabolites, such as glucosinolates and alliins. An approach similar to that of Hirai et al. (6), but focusing entirely on secondary metabolites, was introduced recently by Goossens et al. (7), who combined cDNAamplified fragment length polymorphism (AFLP) transcript profiling with targeted metabolite analysis to map the biosynthetic genes involved in alkaloid metabolism. Because sequence information for many medicinal plants is very limited, the cDNA-AFLP transcript profiling provided a very powerful tool to identify many candidate genes involved in the production of secondary metabolites. Functional analysis of these candidate genes will generate a lot of data and might help to find not only the biosynthetic genes of a particular plant pathway but also master regulators such as transcription factors that are involved in plant secondary metabolism in general. Such information is crucial for applications to overcome, e.g., the low product yield using cultivated plant cell cultures (8). The general problems encountered when characterizing the plant metabolome are the highly complex nature and the enormous chemical diversity of the compounds. The metabolome cannot be computed from the genome (9). Plants produce 200,000 metabolites (10), many of which play specific roles in allowing adaptation to specific ecological niches. It has been estimated that 25–30% of the genes of Arabidopsis encode enzymes of metabolism (1). The range of chemical properties sets a challenge to the analytical tools both for profiling multiple metabolites in parallel and for quantitatively analyzing the selected ones. This has especially become obvious in secondary metabolite analysis, which is far more complex than metabolite profiling of primary metabolites. Metabolites have very different chemical natures, which influence their extractability in various solvents, pH requirements, and sensitivity for extraction conditions (e.g., temperature, pressure, time). As a consequence, if applying one general extraction system, it is very likely that many metabolites remain in the plant matrix and cannot be profiled. On the other hand, if the accuracy of the extraction systems is increased, fewer metabolites are analyzed. One of the key challenges of metabolite profiling therefore is finding an optimal balance between the accuracy and coverage of metabolite measurements. This can be achieved by first splitting the metabolomics platform into subgroups of compounds sharing common chemical properties from the perspective of extraction conditions and chromatography (Fig. 1). A similar strategy has already been proposed in the domain of drug discovery (11). Advances in instrumentation for metabolite analyses are empowering us with the ability to increase the coverage of metabolites within a single analysis. For example, Hirai et al. (6) used the Fourier transform–ion cyclotron MS, with mass resolution 100,000 and accuracy 1 ppm, which enables analysis and separation of complex metabolite mixtures based on isotopic mass alone. More commonly, GC MSand liquid chromatography MS-based approaches have been applied in plant metabolomics applications (10). Following analytical measurements, the role of data processing algorithms is to detect the peaks in spectral data, match the corresponding peaks across multiple samples, and correct the peak intensities caused by instrumental variability (normalization). These methods enable us to track differentially the metabolite levels across multiple environmental conditions or time points, even if some of the compound identities are not known. Although progress has been made in our ability to differentially track large numbers of peaks (11, 12), advances are still needed to integrate spectral analysis with prior (compound) information and improve quantitative estimates by combining traditional analytical approaches with new
منابع مشابه
Identification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملIntroducing Critical Pain-related Genes: A System Biology Approach
Introduction: Pain is valuable in diagnosis and also warning of the patients. Many molecular reagents are introduced which are related to pain. In this research, the pain-related genes are screened to identify the critical ones. Methods: First, the pain-related genes were pulling out from the STRING database, and Cytoscape software was used to make the interactome unit. Then the central genes...
متن کاملA network-based method for target selection in metabolic networks
MOTIVATION The lack of new antimicrobials, combined with increasing microbial resistance to old ones, poses a serious threat to public health. With hundreds of genomes sequenced, systems biology promises to help in solving this problem by uncovering new drug targets. RESULTS Here, we propose an approach that is based on the mapping of the interactions between biochemical agents, such as prote...
متن کاملIn silico methods for linking genes and secondary metabolites: The way forward
In silico methods for linking genomic space to chemical space have played a crucial role in genomics driven discovery of new natural products as well as biosynthesis of altered natural products by engineering of biosynthetic pathways. Here we give an overview of available computational tools and then briefly describe a novel computational framework, namely retro-biosynthetic enumeration of bios...
متن کاملPrevalence of Tetracycline Resistance Genes tet (A, B, C, 39) in Klebsiella pneumoniae Isolated from Tehran, Iran by Multiplex PCR
Background and Objective: Klebsiella pneumoniae is one of the three pathogens that has become a global disease control and treatment problem due to its resistance to common antibiotics. For this reason, it is crucial to study the genes that cause antibiotic resistance in it. Therefore, the aim of this study was to investigate the phenotypic and genotypic frequency of tetracycline resistance in ...
متن کاملPapaya Dieback in Malaysia: A StepTowards A New Insight of Disease Resistance
A recently published article describing the draft genome of Erwiniamallotivora BT-Mardi (1), the causal pathogen of papaya dieback infection in Peninsular Malaysia, hassignificant potential to overcome and reduce the effect of this vulnerable crop (2). The authors found that the draft genome sequenceis approximately 4824 kbp and the G+C content of the genomewas 52-54%, which is very similarto t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 101 27 شماره
صفحات -
تاریخ انتشار 2004